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Abstract 

A search for the Standard Model Higgs boson decaying into a pair of r leptons is reported. The 
analysis is based on a data sample of proton-proton collisions collected by the ATLAS experiment 
at the LHC and corresponding to an integrated luminosity of 4.7 fb 1 . No significant excess over the 
expected background is observed in the Higgs boson mass range of 100-1 50 GeV. The observed 
(expected) upper limits on the cross section times the branching ratio for H ->■ t + t~ are found to be 
between 2.9 (3.4) and 1 1 .7 (8.2) times the Standard Model prediction for this mass range. 
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Abstract: A search for the Standard Model Higgs boson decaying into a pair of r leptons is 
reported. The analysis is based on a data sample of proton-proton collisions collected by the 
ATLAS experiment at the LHC and corresponding to an integrated luminosity of 4.7 fb -1 . No 
significant excess over the expected background is observed in the Higgs boson mass range of 
100-150 GeV. The observed (expected) upper limits on the cross section times the branching 
ratio for H — > t + t~ are found to be between 2.9 (3.4) and 11.7 (8.2) times the Standard 
Model prediction for this mass range. 
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1 Introduction 

The Higgs boson is the only fundamental particle in the Standard Model (SM) of particle 
physics that has not yet been observed. It is predicted by the Higgs mechanism [1-6], which 
in the SM gives mass to particles. The search for the Higgs boson is a centrepiece of the 
Large Hadron Collider (LHC) physics programme. 

An indirect constraint on the Higgs boson mass of run < 185 GeV at the 95% confidence 
level (CL) has been set using global fits to electroweak precision data [7]. Direct searches at 
LEP and the Tevatron have placed exclusion limits at 95% CL for win < 114.4 GeV and in 
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the region 147 GeV < ran < 179 GeV [8, 9], respectively. The results of searches in various 
channels using data corresponding to an integrated luminosity of up to 5 fb _1 have recently 
been reported by both the ATLAS and CMS Collaborations [10, 11] excluding the mass range 
between 112.9 GeV and 115.5 GeV and the region between 127 GeV and 600 GeV, again at 
the 95% CL. 

In the Higgs boson mass range 100-150 GeV, the H — > t + t~ decay mode is a promising 
channel for the search at the LHC with branching ratios between 8% and 1.8%. The H — > 
t + t~ search is complementary to searches with other decays in the same mass range and 
enhances the overall sensitivity. Also, if the Higgs boson is discovered, the measurement of 
the H —7- t + t~ decay rate provides a test of the SM prediction for the r Yukawa coupling. 

The process with the largest cross section to produce a SM Higgs boson at the LHC is 
gluon fusion gg — > H. However, Higgs boson production via vector boson (WW, ZZ) fusion 
qq — > qqH (VBF) and via Higgs-strahlung qq — > VH, in association with a hadronically 
decaying vector boson (V = W or Z), are highly relevant as well because they lead to 
additional jets in the final state, which provide distinct experimental signatures. In particular 
the VBF topology of two high-energy jets with a large rapidity separation offers a good 
discrimination against background processes. For all production processes above, the Higgs 
boson is typically more boosted in the transverse plane if there are additional high-pxjets 
in the event, which increases the transverse momentum of the r decay products and thus 
facilitates the measurement of the r + r~ invariant mass and the discrimination of the signal 
from background processes. 

This paper presents SM Higgs boson searches in the the H — > Tjg Tj~ , H — > t£ , and 
H — > t^jTj^j channels, 1 where T\ ep and Th a d denote leptonically and hadronically decaying 
r leptons, respectively. The data analyses use proton-proton (pp) collisions at ^fs = 7 TeV 
collected by the ATLAS experiment in 2011, which correspond to an integrated luminosity 
of 4.7 fb _1 . In order to enhance the sensitivity of the search, the selected events are analysed 
in several separate categories according to the number and topology of reconstructed jets. 

2 Data and Monte Carlo simulated samples 

The ATLAS detector is a multipurpose apparatus with a forward-backward symmetric cylin- 
drical geometry and nearly 4ir coverage in solid angle [12]. It consists of an inner track- 
ing detector surrounded by a thin superconducting solenoid, electromagnetic and hadronic 
calorimeters, and an external muon spectrometer incorporating three large superconducting 
air-core toroid magnets. Electrons, muons, r leptons and jets can be reconstructed and iden- 
tified in the ATLAS detector. 2 Only data taken with all sub-systems relevant to this analysis 

1 Charge-conjugated decay modes are implied. Throughout the remainder of the paper, a simplified notation 
without the particle charges is used. 

2 ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in 
the centre of the detector and the z-axis along the beam pipe. The a-axis points from the IP to the centre of 
the LHC ring, and the j/-axis points upwards. Cylindrical coordinates (r, cj>) are used in the transverse plane, 
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operational are used. This results in an integrated luminosity of 4.7 fb for the full 2011 
data sample. 

The signal contributions considered here include the gluon fusion production process, 
the VBF production process, and the Higgs-strahlung VH production in association with a 
hadronically-decaying vector boson V = W, Z . For the decay of the Higgs boson, the H — > 
tt mode is considered. Next-to-next-to-leading-order (NNLO) Quantum Chromodynamics 
(QCD) corrections, soft-gluon resummations calculated in the next-to-next-to-leading-log- 
approximation and next-to-leading-order (NLO) electroweak (EW) corrections are applied 
to the signal cross sections for the gg —> H production process [13-24]. The cross sections 
of the VBF process are calculated with full NLO QCD and electroweak corrections [25-27], 
and approximate NNLO QCD corrections [28]. The WH/ZH processes are calculated with 
NNLO QCD corrections [29, 30] and NLO EW radiative corrections are applied [31]. The 
Higgs boson decay branching ratios are calculated using HDECAY [32]. The gg — >■ H and 
VBF processes are modelled using the POWHEG [33, 34] Monte Carlo (MC) generator, 
interfaced to PYTHIA [35] for showering and hadronisation. In the gg — )• H process, the 
Higgs boson pt spectrum is reweighted to agree with the prediction of the HqT program [36]. 
The associated VH production process is modelled using PYTHIA. 

ALPGEN [37], interfaced to HERWIG [38], with the MLM matching scheme [39] is used to 
model the production of single W and bosons decaying to charged leptons in association 
with jets. MC@NLO [40] is used to model tt, WW, WZ and ZZ production processes, using 
HERWIG for the parton shower and hadronisation and JIMMY [41] for the underlying event 
modelling. AcerMC [42] is used to model single top-quark production for all three production 
channels (s-channel, i-channel, and Wt production). 

The TAUOLA [43] and PHOTOS [44] programs are used to model the decay of r leptons 
and the Quantum Electrodynamics (QED) radiation of photons, respectively. 

The set of parton distribution functions (PDF) CT10 [45] is used for the MC@NLO 
samples, as well as CTEQ6L1 [46] for the ALPGEN samples, and MRST2007 [47] for the 
PYTHIA and HERWIG samples. Acceptances and efficiencies are based on a simulation of 
the ATLAS detector using GEANT4 [48, 49]. Since the data are affected by the detector 
response to multiple interactions (pileup) occurring in the same or nearby bunch crossings, 
the simulation includes a treatment of the event pileup conditions present in the 2011 data. 

The Z/^f* — > tt background processes are modelled with a r-embedded — > data 
sample as described in Section 6. 

3 Selection and reconstruction of physics objects 

Electron candidates are formed from an energy deposit in the electromagnetic calorimeter 
and associated to a track measured in the inner detector. They are selected if they have a 
transverse energy £/p > 15 GeV, lie within \r]\ < 2.47 but outside the transition region between 

4> being the azimuthal angle around the beam pipe. The pseudorapidity is defined in terms of the polar angle 
6 as 77 = -mtan(0/2). 
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the barrel and end-cap calorimeters (1.37 < \rj\ < 1.52), and meet quality requirements based 
on the expected shower shape [50]. 

Muon candidates are formed from a track measured in the inner detector and linked to 
a track in the muon spectrometer [51]. They are required to have a transverse momentum 
PT > 10 GeV and to lie within |r/| < 2.5. Additionally, the difference between the ^-position 
of the point of closest approach of the muon inner detector track to the beam-line and the 
z-coordinate of the primary vertex is required to be less than 1 cm. 3 This requirement reduces 
the contamination due to cosmic ray muons and beam-induced backgrounds. Muon quality 
criteria based on, e.g., inner detector hit requirements are applied in order to achieve a precise 
measurement of the muon momentum and reduce the misidentification rate. 

Identified electrons and muons are required to be isolated: the additional transverse 
energy in the electromagnetic and hadronic calorimeters must be less than 8% (4%) of 
the electron transverse energy (muon transverse momentum) in a cone of radius AR = 
\J (At/) 2 + (Acft) 2 = 0.2 around the electron (muon) direction. The sum of the transverse 
momenta of all tracks with pp above 1 GeV located within a cone of radius AR = 0.4 around 
the electron (muon) direction and originating from the same primary vertex must be less than 
6% of the electron transverse energy (muon transverse momentum). 

Jets are reconstructed using the anti-fct algorithm [52] with a distance parameter value 
of R = 0.4, taking as input three-dimensional noise-suppressed clusters in the calorimeters. 
Reconstructed jets with pt > 20 GeV and within \rj\ < 4.5 are selected. Events are discarded 
if a jet is associated with out-of-time activity or calorimeter noise. After having associated 
tracks to jets by requiring AR < 0.4 between tracks and the jet direction, a jet-vertex fraction 
( JVF) is computed for each jet as the scalar px sum of all associated tracks from the primary 
vertex divided by the scalar pt sum of all tracks associated with the jet. Conventionally, 
JVF = —1 is assigned to jets with no associated tracks. Jets with \r]\ < 2.4 are required to 
have | JVF | > 0.75 in order to suppress pileup contributions. In the pseudorapidity range 
\r]\ < 2.5, fe-jets are identified using a tagging algorithm based on the discrimination power of 
the impact parameter information and of the reconstruction of the displaced vertices of the 
hadron decays inside the jets [53]. The 6-tagging algorithm has an average efficiency of 58% 
for 6-jets in it events [54]. The corresponding light-quark jet misidentification probability is 
0.1-0.5%, depending on the jet pt and rj [55]. 

Hadronic decays of r leptons are characterised by the presence of one or three charged 
hadrons accompanied by a neutrino and possibly neutral hadrons, which results in a collimated 
shower profile in the calorimeters and only a few nearby tracks. The visible decay products 
are combined into Th a d candidates. These candidates are reconstructed as jets, which are re- 
calibrated to account for the different calorimeter response to hadronic decays as compared to 
hadronic jets. The four-momentum of the Th a d candidates are reconstructed from the energy 
deposits in the calorimeters and the rejection of jets misidentified as hadronic r decays is 
performed by a multivariate discriminator based on a boosted decision tree [56] that uses 

3 The primary vertex is defined as the vertex with the largest "YlPr °f the associated tracks. 
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both tracking and calorimeter information. The identification is optimised to be 50% efficient 
while the jet misidentification probability is kept below 1%. A Thad candidate must lie within 
1 97 1 < 2.5, have a transverse momentum greater than 20 GeV, one or three associated tracks 
(with px > 1 GeV) and a total charge of ±1 computed from the associated tracks. Dedicated 
electron and muon veto algorithms are used. 

When different objects selected according to the above criteria overlap with each other 
geometrically (within AR < 0.2), only one of them is considered for further analysis. The 
overlap is resolved by selecting muon, electron, Th a d an d jet candidates in this order of priority. 

The magnitude of the missing transverse momentum [57] (E!f lss ) is reconstructed in- 
cluding contributions from muon tracks and energy deposits in the calorimeters. Calorimeter 
cells belonging to three-dimensional noise-suppressed clusters are used and they are calibrated 
taking into account the reconstructed physics object to which they belong. 

4 Preselection 

An initial selection of events is performed by requiring a vertex from the primary pp collisions 
that is consistent with the beam spot position, with at least three associated tracks, each 
with pt > 500 MeV. Overall quality criteria are applied to suppress events with fake E!f lss , 
produced by non-collision activity such as cosmic ray muons, beam-related backgrounds, or 
noise in the calorimeters. 

The ££, ^Thad and Thad'Tiad final states. 4 considered in this search are defined in a mutually 
exclusive way: a requirement of exactly two, one, or zero electrons or muons is imposed, 
respectively. 

4.1 H —> TlepTlep 

Signal events in this channel are selected by requiring exactly two isolated and oppositely- 
charged light leptons (electrons and/or muons). Single lepton and di-lepton triggers are used 
to preselect the data. The trigger object quality requirements were tightened during the 
data-taking period to cope with increasing instantaneous luminosity. The single muon trigger 
requires px > 18 GeV; for the single electron trigger the £x threshold changes from 20 GeV 
to 22 GeV depending on the LHC instantaneous luminosity; the di-muon trigger requires 
Pt > 15 GeV for the leading muon and px > 10 GeV for the sub-leading muon; the di- 
electron trigger requires £x > 12 GeV for each of the two electrons; the ep trigger requires 
-&r > 10 GeV for the electron and px > 6 GeV for the muon. In addition to the trigger 
requirements, the preselection requires Ex > 22 GeV if the electron satisfies only the single 
electron trigger. The Ex requirement is increased to 24 GeV when the trigger threshold is 
22 GeV. If a muon is associated only with the single muon trigger object, it is required to have 
Pt > 20 GeV. For the efi channel the di-lepton invariant mass is required to be in the range 
of 30 GeV < mu < 100 GeV, whereas for the ee and channels 30 GeV < mu < 75 GeV 
is required, reducing the contamination from Z/'j* — > it. 

4 Here I denotes an electron or a muon (also referred to below as light leptons). 
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4.2 H — > TlepTliad 

Signal events in this channel are characterised by exactly one isolated light lepton £, a r^ad 
candidate, and large E™ 1SS due to the undetected neutrinos. For the erh a d (/UThad) final 
states, events are preselected using the single electron (muon) trigger described in Section 4.1. 
Exactly one electron with E^ > 25 GeV or one muon with p^ > 20 GeV, and one oppositely- 
charged Thad candidate with p^ > 20 GeV are required in the event. Events with more than 
one electron or muon candidate are rejected to suppress events from Z/j* — > £ + £~ decays 
and from ti or single top-quark production. 5 The transverse mass of the lepton and E™ 1SS is 
calculated as 

m T = ^2p e T E™ iss (l - cosA0) , (4.1) 

where p T denotes the magnitude of the transverse momentum of the lepton and Acj) is the angle 
between the lepton and E™ 1SS directions in the plane perpendicular to the beam direction. 
In order to reduce contributions from the T^+jets and ti background processes, only events 
with mT < 30 GeV are considered for further analysis. In addition, E™ 1SS is used for further 
event selection and categorisation, as described in Section 5. 

4.3 / / — >■ T^adThad 

Signal events in this channel are characterised by two identified hadronic r decays and large 
E!f lss from the undetected neutrinos. The corresponding event selection starts with a double 
hadronic r trigger, where the thresholds are 29 GeV and 20 GeV for the leading and 
sub-leading hadronic r objects, respectively. A requirement of exactly zero charged light 
leptons, as defined in Section 3, is imposed. Two identified opposite charge r^ad candidates 
with j>t > 35 GeV and p^ > 25 GeV are required, each matching a r trigger object [58]. 

5 Analysis categories 

For further analysis, the selected event samples are split into several categories according to 
the number and topology of reconstructed jets. The sensitivity of the search is usually higher 
for categories where the presence of one or more jets is required, as discussed in Section 1, 
but events without any reconstructed high-ppjets are also considered in order to maximise 
the sensitivity. 

5.1 H — >■ TlepTlep 

Four categories defined by their jet multiplicity and kinematics are used for this channel: 
H +2-jet VBF, H +2-jet VH, H +l-jet and H +0-jet. The first two categories require the 
presence of at least two jets and the cuts are optimised in one case for the VBF process [59-61], 
and in the other for the VH and gg —> H processes [62]. 

5 For the purpose of vetoing additional light leptons, the isolation requirements are removed from the muon 
selection and a looser identification requirement is used for electrons. The transverse momentum (energy) 
threshold for muons (electrons) is lowered to 10 GeV (15 GeV). 
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The H +0-jet category uses an inclusive selection to collect part of the signal not selected 
by the categories with jets. In the H +0-jet category, only the efi final state is considered 
because of the overwhelming Z/j* — > ti background in the ee and \x\i final states. In order 
to suppress the ti background, it is required that the di-lepton azimuthal opening angle be 
A(f>££ > 2.5 rad and that the leptonic transverse energy be = p^ +p^ +E>f ms < 120 GeV, 
where p^ and p^ are the magnitudes of the transverse momenta of the leading and sub-leading 
leptons, respectively. 

In categories with jets (H +2-jet VBF, H +2-jet VH and H +l-jet), the presence of a 
hadronic jet with a transverse momentum pp > 40 GeV is required and, to suppress the ti 
background, the event is rejected if any jet with pp > 25 GeV is identified as a 6-jet. In 
addition, E™ 1SS > 40 GeV (E™ 1SS > 20 GeV) for the ee, /i/i (efi) channels is also required. 

The collinear approximation technique [63] is used to reconstruct the kinematics of the 
tt system. The approximation is based on two assumptions: that the neutrinos from each 
r decay are nearly collinear with the corresponding visible r decay products and that the 
E™ ss in the event is due only to neutrinos. In this case, the total invisible momentum carried 
away by neutrinos in each r decay can be estimated from the polar and azimuthal angles 
of the visible products of each r decay. Then, the invariant mass of the tt system can 
be calculated as m TT = m^j ^/x\ ■ X2, where x\ and x% are the momentum fractions of the 
two t candidates carried away by their visible decay products. Events that do not satisfy 
0.1 < xi,X2 < 1.0 are rejected. In categories with jets, there is an additional requirement 
that 0.5 rad < A0« < 2.5 rad to suppress the Z/j* — > It background. 

For the H -|-2-jet categories, a subleading jet with px > 25 GeV is required in addition. 
For the H +2-jet VBF category, a pseudorapidity difference between the two selected jets of 
Ar/jj > 3 and a di-jet invariant mass of rrijj > 350 GeV are required. Finally, the event is 
rejected in the i?+2-jet VBF category if any additional jet with p^ > 25 GeV and \r]\ < 2.4 
is found in the pseudorapidity range between the two leading jets. 

For the H +2-jet VH category, the requirement on the pseudorapidity separation of the 
jets and on the di-jet invariant mass are instead: Arjjj < 2 and 50 GeV < rrijj < 120 GeV. 

Only events failing the cuts for the H -|-2-jet categories are considered in the H +l-jet 
category. For the H +l-jet category, the invariant mass of the two r leptons and the leading 
jet is required to fulfil m TT j > 225 GeV, where the r momenta are taken from the collinear 
approximation. The main Higgs production mechanism in this category is the gg —> H process 
plus a high-px parton. 

The m TT calculated with the collinear approximation ("collinear mass") is used in cate- 
gories with jets. Because this variable displays poor resolution in the H +0-jet category due 
to the back-to-back configuration of the two leptons, the effective mass (m^. ), defined as the 
invariant mass of the two leptons and the E™ 1SS , is used instead. 

5.2 H — > Tl e pTh a d 

The selected data are split into seven categories based on jet properties and E™ 1SS . 
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The H+2-jet VBF category includes all selected events with E!f lss > 20 GeV and at least 
two jets with pp > 25 GeV, where the two leading jets are found in opposite hemispheres of 
the detector {j]j e t\ • Tjjetz < 0)> with A?7,-j > 3 and rrijj > 300 GeV. Both the lepton and the 
Thad candidate are required to be found in the pseudorapidity range between the two leading 
jets. Due to the limited size of the selected event samples, the VBF category combines the 
eThad and /iTh ac i final states. 

Two H + 1-jet categories include all selected events with E™ 1SS > 20 GeV and at least 
one jet with px > 25 GeV, that fail the VBF selection. The erhad and //rh a d final states are 
considered separately. 

Four H +0-jet categories include all selected events without any jet with px > 25 GeV. 
The erhad and prh a d final states are considered separately. In addition, the analysis is sepa- 
rated into events with E^ iss > 20 GeV and E^ iss < 20 GeV. The low-E^ iss region is included 
here because, in the absence of high-px jets, the Higgs decay products, including the neutri- 
nos, are typically less boosted than for events with additional jet activity. 

For each category, the mass of the rr system is reconstructed using the Missing Mass 
Calculator (MMC) [64]. This technique provides a full reconstruction of event kinematics 
in the rr final state with 99% efficiency and 13-20% resolution in m TT , depending on the 
event topology (better resolution is obtained for events with high-px jets). Conceptually, the 
MMC is a more sophisticated version of the collinear approximation. The main improvement 
comes from requiring that relative orientations of the neutrinos and other decay products are 
consistent with the mass and kinematics of a r lepton decay. This is achieved by maximising 
a probability defined in the kinematically allowed phase space region. 

5.3 / / — > ThadThad 

In the H -)• 

ThadTiad channel, only a single H -\- 1-jet category is defined. After selecting 
two hadronic r candidates, the collinear mass approximation cuts < x\,x<i < 1 are applied. 
Events at this stage are used as a control sample to derive the normalisation of the Z/j* — > tt 
background. Then, events are selected if E™ 1SS > 20 GeV and if the leading jet has a 
transverse momentum px > 40 GeV. The two r candidates are required to be separated by 
AR(t, t) < 2.2. Also, only events with an invariant mass of the tt pair and the leading jet 
m TT j > 225 GeV are considered for further analysis. The event selection criteria described 
here are effective against the multi-jet and Z/^y* — > tt backgrounds. The collinear mass 
approximation is used for the Higgs mass reconstruction. 

6 Background estimation and modelling 

The background composition and normalisation are determined using data-driven methods 
and the simulated event samples described in Section 2. 

The main background to the Higgs boson signal in all selected final states is the largely 
irreducible Z/^* — > tt process. While it is not possible to select a Higgs signal- free Z/j* — > 
tt sample directly from the data, this background is still modelled in a data-driven way, by 
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Figure 1. (a) E™ ss distributions for the Z/'j* — s- data before and after muon embedding and (b) 
MMC mass distributions (defined in Section 5) for the r-embedded zJ/7* — > fxfi data and simulated 
Z/7* — > tt events. For (a) only statistical uncertainties are shown; (b) also includes systematic 
uncertainties associated with the embedding procedure as discussed in Section 7. 



choosing a control sample where the expected signal contamination is negligible. In a sample 
of selected Z/j* — > data events, the muon tracks and associated calorimeter cells are 
replaced by r leptons from a simulated Z/-y* — > tt decay with the same kinematics, where 
the r polarisation and spin correlations are modelled with the TAUOLA program and the t-li 
mass difference is taken into account as well. Thus, only the r decays and the corresponding 
detector response are taken from the simulation, whereas the underlying event kinematics and 
all other properties — including pileup effects — are obtained from the data. These embedded 
data are used to model the shape of the relevant Z/^f* —> tt background distributions as well 
as the efficiency for selecting Z/7* — > tt events from the preselected sample. The overall 
normalisation at the preselection level is obtained from simulation. 

The procedure is extensively validated: example results for the H — > Ti ep Th a d channel 
are shown in Figure 1. Systematic effects intrinsic to the method are studied by replacing 
the muons selected in data by simulated muons instead of r decays. Figure 1(a) shows a 
comparison of the £'™ 1SS distributions from the selected Z/j* — > fifj, events in data before and 
after this muon embedding, demonstrating that the embedding procedure does not introduce 
any significant bias to the reconstruction of the event properties. Figure 1(b) compares the 
MMC mass distributions reconstructed from the r-embedded Zj^* — > fifi data and simulated 
Z/7* —7- tt events after the preselection described in Section 4; good agreement is found and 
similar studies for the other channels yield the same conclusions. 
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6.1 H — > Tl e pT\ e p 

The Z/^y* — > tt background is modelled using the embedding procedure described above. 
The contribution from Z/j* — > £ + l~ is determined by scaling the yields in the Monte Carlo 
simulation using correction factors obtained by comparing data to simulation in low- and 
high-i?™ lss control regions enriched in these backgrounds. The correction factors are obtained 
separately for Z/'y* — > ee and Z/'y* — > ^[i and for the different analysis categories and are 
on the order of 10%. 

The fake lepton background consists of events that have a reconstructed lepton that did 
not originate from the decay of a r lepton or the leptonic decay of a W or Z boson. The 
normalisation and shape of relevant distributions are obtained from data with a template 
method using a control region in which the lepton isolation requirement is reversed. The 
chosen template shape is the pt distribution of the sub- leading lepton. For this method to 
be applied, it is first verified that the template shapes of the fake lepton distribution in the 
control and signal regions agree within uncertainties. This is performed at intermediate steps 
of the event selection where the data sample is dominated by background events and where 
the number of expected signal events is negligible. 

After subtracting the simulated backgrounds, the template shape in a given distribution 
is obtained from the control region, while the normalisation is obtained from a fit of the 
distribution of the events in the signal region with the template shape. The uncertainty related 
to the estimation of backgrounds with fake leptons is calculated from the uncertainty on the 
subtraction of other processes from Monte Carlo simulation and from the difference in the 
Pt shape of the events in the control region and signal regions. Such systematic uncertainties 
lie in the range of 30-40%. 

The contributions of the tt , single top-quark and electroweak di-boson backgrounds are 
estimated from simulation. The Monte Carlo description of the top-quark backgrounds has 
been validated using data by selecting control regions enriched in top-quark background 
processes. The control regions are defined by inverting the 6-jet selection for the H +2-jet 
VBF, H +2-jet VH, if+l-jet categories and by inverting the H^ p selection for the if +0-jet 
category. 

Table 1 displays the number of events expected and observed in the four categories 
after all selection criteria including all systematic uncertainties described in Section 7. The 
estimated combined background contributions are found to give a good description of all 
quantities relevant to the analysis. As examples, the distributions of the jet multiplicity, 
E™ 1SS , the invariant mass of the two leading jets and their pseudorapidity difference are 
shown in Figure 2. Figure 3 displays the invariant mass spectra of the selected events for the 
four categories. 
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Table 1. Number of events after the H — > ri cp Ti cp selection for the four categories in data and 
predicted number of background events, for an integrated luminosity of 4.7fb _1 . Expectations for 
the Higgs boson signal {ran = 120 GeV) are also given. Statistical and systematic uncertainties are 
quoted, in that order. 





ee + [1^ + ep, 


ee + ^ + 


ee ± ± e/i 


ep 




H +2-jet VBF 


H+2-jet VH 


H ±l-jet 


H ±0-jet 


gg -> H signal 


0.26±0.06±0.10 


0.8 ±0.1 ±0.2 


3.9 ±0.2 ±1.0 


23 ±1 ±3 


VBF H signal 


1.08±0.03±0.11 


0.10±0.01±0.01 


1.15±0.03±0.01 


0.75±0.03±0.06 


VH signal 


0.01±0.01±0.01 


0.53±0.02±0.07 


0.40±0.02±0.03 


0.52±0.02±0.04 


Z/-J* -> T+T~ 


24 ±3 ±2 


107 ±12 ± 9 


(0.52±0.01±0.04)-10 3 


(9.68±0.05±0.07)-10 3 


z/j* -> i+r (e=e,n) 


2 ± 1 ± 1 


25 ±4 ±9 


83 ±10 ±30 


185 ±11 ±14 


tt ±single top 


7 ± 1 ± 2 


42 ±2 ±6 


98 ±3 ±12 


169 ± 4 ±14 


ww/wz/zz 


0.9± 0.3± 0.3 


6 ±1 ±1 


21 ±1 ±3 


221 ± 3 ±18 


Fake leptons 


1.3± 0.8± 0.6 


13 ±2 ±5 


30 ± 4 ±12 


( 1.2±0.5)-10 3 


Total background 


35 ±3 ±4 


193 ± 7 ±20 


(0.75±0.01±0.05)-10 3 


(11.4±0.5)-10 3 


Observed data 


27 


185 


702 


11420 
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Figure 2. Distributions of (a) the jet multiplicity, (b) the E™ lsa , (c) the invariant mass of the 
two leading jets and (d) the J] difference of the two leading jets in the H — > TLe P 7iep channel for the 
selection criteria described in the text. Simulated samples are normalised to an integrated luminosity of 
4.7 fb -1 . For illustration only, the signal contributions have been scaled by factors given in the legends. 
30 GeV < mu < 100 GeV is required for the ee and /i/x channels and 30 GeV < m« < 75 GeV for 
the efj, channel. For the E™ lss distribution the presence of a leading jet with pt > 40 GeV is required; 
for the invariant mass and for the rj difference of the two leading jets, the presence of a sub-leading 
jet with > 25 GeV is required in addition. 
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Figure 3. Reconstructed m TT of the selected events in the H — > T\ ep T\ cp channel for the four categories 
described in the text. Simulated samples arc normalised to an integrated luminosity of 4.7 fb" 1 . 
Predictions from the Higgs boson signal (m# = 120 GeV) and from backgrounds are given. In the 
case of the H + 0-jct category to°^ is used. For illustration only, the signal contributions have been 
scaled by factors given in the legends. 



- 13 - 



6.2 H — > TlepThad 

In order to estimate the background contributions to the selected H — > Ti ep Thad candidate 
events, a control sample in data is obtained by applying the signal selection described in 
Section 5.2 but now requiring that the light lepton and the Th a d candidate have the same 
charge. This control sample is referred to as the same-sign (SS) sample here, in contrast to 
the opposite-sign (OS) signal sample. The number of OS background events in the signal 
region (n^ ) can be expressed as 

bkg _ all , W+jets ^Z^tt , other / fi -, \ 

n OS — n SS + n OS-SS + n OS-SS + n OS-SS' (P- 1 ) 

where n|g is the sum of all SS backgrounds in the signal region and the remaining terms are 
the differences between the number of OS and SS events for W + jets , Z/'j* — > tt and other 
backgrounds, respectively. Due to their large production cross sections, multi-jet processes 
provide a significant background if quark/gluon jets are misidentified as hadronic r decays. 
The ratio of OS to SS events for the multi-jet background ('"os/ss) * s ex P ec t e d to be close 
to unity and therefore ^qs-SS = i s assumed. This assumption is validated with a control 
sample that is dominated by Iow-^t jets from multi-jet processes. This sample is selected by 
replacing the requirement E™ 1SS > 20 GeV with E™ 1SS < 15 GeV and removing the isolation 
criteria of the electron or muon candidate. After subtraction of the other backgrounds using 
simulation, a value of ^qs/ss = ^ 0-01(stat.) ± 0.09(syst.) is obtained. The observed 
deviation of r og/g S from unity is taken into account as a systematic uncertainty for the final 
result. 

The Z/-y* — > tt contribution is estimated from the r-embedded Z/j* — > nfi sample in the 
data, as described previously. For the W + jets background, a significant deviation of the ratio 
of OS and SS events (^^s/ss) f rom unity is expected since W + jets production is dominated 
by guj gd-processes that often give rise to a jet originating from a quark, the charge of which is 
anti-correlated with the W boson charge. The predicted number of VF+jets background events 
is obtained from the simulation after applying a normalisation correction factor determined 
from VF-dominated data control samples. 6 The remaining contributions raQgl r ss are taken 
from the simulation. 

Table 2 displays the number of events expected and observed in the seven categories 
after the full signal selection, including all systematic uncertainties as described in Section 7. 
The estimated combined background contributions are found to give a good description of all 
quantities relevant to the analysis. As examples, the distributions of E™ ss , the transverse 
mass of the lepton- E™ ss system as well as the invariant mass of the two leading jets and 
their pseudorapidity difference are shown in Figure 4. Figure 5 shows the corresponding 
tt invariant mass spectra, where the electron and muon categories have been combined for 
illustration purposes. The data are found to be consistent with the estimated combined 
background contributions in both normalisation and shape within the uncertainties. 



6 These control samples are defined by replacing the mx < 30 GeV requirement in the nominal selection 
with tot > 50 GeV. 
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Table 2. Predicted number of signal events (for m# = 120 GeV) and predicted backgrounds obtained as described in the text, together 
with the observed number of events in data for the H — > Ti op ri ia d categories. The total background yield predicted by the alternative 
estimation method is given as well for comparison. The listed uncertainties are statistical and systematic, in that order. 





H + 


0-jet (low E™ iss ) 




F+0-jct (high E™ iss ) 






Electron 




Muon 




Electron Muon 




ggH signal 


11 ± 1 ±2 




17 ± 1 ±4 




7.1 ± 0.8 ± 1.5 9.8 ± 0.9 ± 2.1 




VBF H signal 


0.08 ± 0.02 ± 0.12 


0.11 ± 0.03 ± 0.03 


0.09 ± 0.02 ± 0.02 0.14 ± 0.03 ± 0.03 


VH signal 


0.07 ± 0.02 ± 0.05 


0.10 ± 0.03 ± 0.01 


0.08 ± 0.0 2± 0.01 0.08 ± 0.02 ± 0.01 


all 

"ss 


(3.3 ±0.2 ±0.7 ) 


• 10 3 


(2.0 ±0.1 ±0.4 ) • 


10 3 


(0.69 ± 0.06 ± 0.14) • 10 3 (0.47 ± 0.04 ± 0.09) • 


10 3 


W+jcts 

"os-ss 


(0.33 ±0.02 ±0.04) 


• 10 3 


(0.50 ±0.02 ±0.07) • 


10 3 


(0.15 ± 0.01 ± 0.02) • 10 3 (0.18 ± 0.01 ± 0.03) • 


10 3 


"os-ss 


(3.70 ±0.06 ±0.61) 


■ 10 3 


(7.29 ± 0.06 ± 1.21) • 


10 3 


(1.49 ± 0.04 ± 0.23) • 10 3 (2.80 ± 0.04 ± 0.42) • 


10 3 


~,othcr 
"OS-SS 


(0.97 ±0.04 ±0.22) 


• 10 3 


(0.59 ±0.04 ±0.14)- 


10 3 


(0.27 ± 0.02 ± 0.08) • 10 3 (0.14 ± 0.02 ± 0.04) • 


10 3 


Total background 


(8.2 ±0.2 ±0.8 ) 


■ 10 3 


(10.4 ±0.2 ±1.2 ) 


10 3 


(2.59 ± 0.07 ± 0.26) • 10 3 (3.59 ± 0.06 ± 0.43) • 


10 3 


Observed data 


8363 




10911 




2545 3570 




Altern. estimate 


(8.7 ±0.1 ±0.8 ) 


• 10 3 


(10.7 ±0.1 ±1.0 ) 


10 3 


(2.76 ± 0.05 ± 0.33) • 10 3 (3.75 ± 0.05 ± 0.47) • 


10 3 








H H 


-1-jet 




H+2-jet VBF 






Electron 




Muon 




Electron + Muon 




ggH signal 


8.1 ± 0.7 ± 1.6 




10.8 ± 0.8 ± 2.2 




0.9 ± 0.2 ± 0.3 




VBFiJ signal 


1.6 ± 0.1 ± 0.1 




1.9 ± 0.1 ± 0.1 




2.2 ± 0.1 ± 0.2 




VH signal 


1.1 ± 0.1 ± 0.1 




1.4 ± 0.1 ± 0.1 




0.02 ± 0.01 ± 0.01 




all 

n SS 


(0.93 ±0.07 ±0.19) 


• 10 3 


(0.49 ±0.04 ±0.10) • 


10 3 


45 ± 7 ±9 




W+jets 

"os-ss 


(0.25 ±0.01 ±0.03) 


■ 10 3 


(0.26 ±0.01 ±0.03) • 


10 3 


5 ± 1 ± 2 




Z-S-TT 

"os-ss 


(1.23 ±0.03 ±0.17) 


• 10 3 


(1.76 ±0.03 ±0.25) • 


10 3 


54 ± 6 ±8 




^.othcr 

"os-ss 


(0.28 ±0.02 ±0.04) 


• 10 3 


(0.24 ±0.01 ±0.03) • 


10 3 


20 ± 3 ±5 




Total background 


(2.69 ±0.08 ±0.26) 


• 10 3 


(2.75 ±0.05 ±0.27) • 


10 3 


124 ±10 ±13 




Observed data 


2610 




2711 




122 




Altern. estimate 


(2.63 ±0.05 ±0.25) 


■ 10 3 


(2.72 ±0.04 ±0.28) • 


10 3 


131 ±11 ±23 





In order to validate these results, an alternative background estimation is performed. 
This second method provides an estimate of the multi-jet background using control samples 
that are defined by separately inverting the light-lepton isolation and the Th a d-^ charge cor- 
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Figure 4. (a) Missing transverse momentum, (b) transverse mass of the lepton-i5™ lss system as well 
as (c) invariant mass and (d) pscudorapidity difference of the two leading jets in the H — > Ti ep Thad 
channel. The preselection criteria and an E™ lsa > 20 GeV requirement are applied except those for 
the displayed quantity; for (c) and (d) the presence of at least two jets with pt > 25 GeV is required 
in addition. Simulated samples are normalised to an integrated luminosity of 4.7 fb -1 . The selected 
events in data are shown together with the predicted Higgs boson signal (mu = 120 GeV) stacked 
above the background contributions (see text). For illustration only, the signal contributions have 
been scaled by factors given in the legends. 
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relation requirements, respectively. In addition to the signal region (A) this results in three 
background-dominated regions B (SS ir^d, t isolated), C (OS ^Th a di ^ anti-isolated) and D 
(SS Irhad, £ anti- isolated) . The shape of the MMC mass distribution for multi-jet contribu- 
tions in the signal region A is taken from the control region B and the normalisation is derived 
from the relation tia = nc/n^ x ne- Here, ne, nc and no denote the event yields in regions 



> 

$4500 
°4000 
I 3500 

CD 

lu 3000 
2500 
2000 
1500 
1000 



500 



T h a d + ^ had "+0-jet(lowEr 



-HT had H+0-jet (high£T s ) 




50 



- Data 

50xH(120)->tt 
| Z-h>xx (OS-SS) 
I Others (OS-SS) 
1 H/+jets (OS-SS) 

Same Sign 
% Bkg. uncert. 

| Ldt = 4.7 fb" 1 
l[s = 7 TeV 
ATLAS 



100 150 200 250 300 350 400 
MMC mass m zz [GeV] 




50 



Data 

30 x H(120Hxx 
Z-^xx (OS-SS) 
Others (OS-SS) 
tV+jets (OS-SS) 
MM Same Sign 
W//, Bkg. uncert. 
J Ldt = 4.7 fb" 1 
{s = 7 TeV 
ATLAS 



100 150 200 250 300 350 400 
MMC mass m TT [GeV] 



(a) Combined H +0-jet (low E% 



(b) Combined H +0-jet (high E% 



> 

CD 

CD 



1000 



CD 
> 



800 



600 



400 



200 



50 



-*- Data 

10 x H(120)^xx 
H Z-vn (OS-SS) 
MM Others (OS-SS) 

mm w+\ets (os-ss) 

Same Sign 
V////, Bkg. uncert. 

| L dt = 4.7 fb" 1 
\fs = 7 TeV 
ATLAS 



100 150 200 250 300 350 400 
MMC mass m XT [GeV] 




> 

CD 

CD 
o 

C\J 



£= 
CD 
> 
LU 



H+2-jet VBF 




Data 

5xH(120)-m 
Z-^xx (OS-SS) 
Others (OS-SS) 
W+jets (OS-SS) 
Same Sign 
'; Bkg. uncert. 

J Ldt- 4.7 fb" 1 
{s = 7 TeV 
ATLAS 



50 100 150 200 250 300 350 400 
MMC mass m n [GeV] 



(c) Combined H +l-jet 



(d) H+2-jet VBF 



Figure 5. MMC mass (defined in Section 5) distributions of the selected events in the H — > Ti ep Th a d 
channel. The corresponding electron and muon categories for the .ff+0-jet and if+l-jet categories are 
shown combined here, while in the data analysis they are considered separately. The selected events 
in data are shown together with the predicted Higgs boson signal (mjj = 120 GeV) stacked above the 
background contributions (see text). For illustration only, the signal contributions have been scaled 
by factors given in the legends. 
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B, C and D, after subtracting the contribution from non-multi-jet backgrounds in all control 
regions. The Z/j* — > tt contribution is estimated from the r-embedded Z/j* — > /i/i data 
and the estimate of the W + jets background is obtained from the simulation after applying 
normalisation correction factors determined from VK-dominated (high-mx) OS and SS data 
control samples. The remaining contributions are obtained from the simulation. The result- 
ing total background estimates are also included in Table 2 and are found to be consistent 
with both the observed number of data events and the predictions of the default background 
estimate within the uncertainties. 

Further studies of specific background contributions are performed by estimating the 
probability to misidentify jets as Th a d candidates in the signal region and using data control 
regions for the background from ti production processes. Results are consistent with the 
background estimates in Table 2 in both cases. 

6.3 // — > ThadThad 

The dominant backgrounds in the H — > rhad r had channel are Z/'y* — > tt and multi-jet pro- 
duction. For both, the normalisation and shape of the mass distribution are estimated using 
data-driven methods. 

The normalisation of the Z/^f* — > tt background is obtained by using collision data events 
at an early stage of the event selection. This data-driven control sample is defined by requir- 
ing that events contain two Th a d candidates that pass the hadronic selections including the 
collinear approximation cuts (0 < X12 < 1 an d AR(tt) < 2.8). To avoid signal contamination 
in this control sample, a requirement that m TT < 100 GeV is applied; this results in a SM 
Higgs signal contamination of less than 0.2%. The Z/^* — > tt contribution is obtained by 
fitting the track multiplicity of the two r candidates simultaneously. The tracks associated 
to the rh a( j candidates are counted in the cone defined by AR < 0.6, as motivated by the 
momentum correlation between tracks in Th a d candidates [56]. A two-dimensional distribution 
of the track multiplicities for these two r candidates is formed, and a track multiplicity fit 
is performed. The multi-jet template is modelled from the same-sign (SS) candidates in the 
data while the Z/j* — > tt contribution is modelled by the simulation. The less significant 
backgrounds are estimated from the simulation and subtracted before the fit is performed. 
The result of the fit is used to normalise the r-embedded Z/j* — > fifi sample described above, 
and then this sample is used to model the acceptance of the later cuts and the mass shape in 
the signal region. 

The multi-jet contribution is estimated by the same two-dimensional track multiplicity 
fitting technique, where the fit is now performed in the signal region. The contribution from 
di-r events is allowed to float in the fit, where the shape comes from the simulation. It is 
assumed that the shape of the two-dimensional track multiplicity in the Z/'y* — > tt and 
Higgs boson signal processes are the same. 

The systematic uncertainties considered for the background prediction arise from the jet 
template statistics, alternative multi-jet track multiplicity templates, the presence of non-di-r 
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background, signal contamination, charge misidentification probability and variations in the 
pileup conditions. 

Instead of using the SS events for the multi-jet track template, the alternative multi-jet 
track multiplicity template is built from events with one additional electron or muon which 
enhances the contribution of VF+jets events, where two jets are misidentified as the hadronic 
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Figure 6. (a) AR of the two r candidates, (b) missing transverse momentum, (c) collinear mass 
and (d) invariant mass of the reconstructed Higgs boson and the leading jet in the H — > ThadTiad 
control region. Expectations from the Higgs boson signal (run = 120 GeV) and from backgrounds 
are given. The error band indicates the total background uncertainty. For illustration only, the signal 
contributions, which are expected to be small in this control region, have been scaled by factors given 
in the legends. 
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taus. This provides a different mix of quark- and gluon-initiated jets from the inclusive multi- 
jet sample, addressing a possible flavour dependence of the track multiplicity distribution. 

The overall systematic uncertainties are 11.6% and 22% for Z/^/* — > tt and multi-jet 
backgrounds, respectively. 

The mass shape from the multi-jet events is modelled by dropping the opposite-sign and 
r had track multiplicity requirements. In order to obtain a data sample that is enriched with 
the multi-jet background, events are accepted if they are same-sign events or if the sum of 
the charges of the products of a single r decay is or ±2. This mass shape model is tested 
against several other hypothesis obtained, e.g., from pure SS samples or events selected with 
looser r identification criteria. 

Figure 6 shows the kinematic distributions of (a) the AR of the two r candidates, (b) 
the missing transverse momentum, (c) the collinear mass and (d) the invariant mass of the 
reconstructed Higgs boson and the leading jet in the H — > Thad^had control region. 

Table 3 presents the event yields after the full event selection, where the yields are 
normalised to 4.7 fb -1 . The collinear mass distribution after full selection is presented in 
Figure 7. 

Table 3. Number of events after the H — > ti^t^^ selection in data and predicted number of 
background events, for an integrated luminosity of 4.7 fb -1 . Predictions for the Higgs boson signal 
(rriH = 120 GeV) are also given. Statistical errors and systematic uncertainties are quoted, in that 
order. 
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— > TWhad 

H +l-jet 


gg — ^ H signal 


3.1 
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0.2 
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0.6 


VBF H signal 


1.51 
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0.05 
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0.18 


VH signal 


0.61 
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0.04 
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0.06 


Z/ 7 * T+T- 


287 
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23 
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34 


W+jets 1 Z+jets 


6.1 
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1.3 
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1.4 


tt +single top 


1.9 
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0.3 
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0.4 


ww/wz/zz 


2.1 
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0.4 
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0.4 


Multi-jet 


54 


± 


21 
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12 


Total background 


348 


± 31 


± 36 



Observed data 317 
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Figure 7. Reconstructed m TT of the selected events in the H —> 7i ia dTh a d channel. Expectations 
from the Higgs boson signal (mjj = 120 GeV) and from backgrounds are given. Results are shown 
after all selection criteria (see text). For illustration only, the signal contribution has been scaled by 
a factor given in the legend. 



7 Systematic uncertainties 



Systematic uncertainties on the normalisation and shape of the signal and background mass 
distributions are taken into account. These are treated either as fully correlated or uncor- 
rected across categories. In the case of partial correlations, the source is separated into 
correlated and uncorrelated components. The dominant correlated systematic uncertainties 
are those on the measurement of the integrated luminosity and on the theoretical predictions 
of the signal production cross sections and decay branching ratios, as well as those related to 
detector response that impact the analyses through the reconstruction of electrons, muons, 
hadronic r decays, jets, E™ 1SS and 6-tagging. 

Theoretical uncertainties: The Higgs boson cross section, branching ratios and their un- 
certainties are compiled in Refs. [65, 66]. The QCD scale uncertainties on the signal cross 
sections depend on mn and are of the order of 1% for the VBF and VH production modes 
and in the range of 8-25% for gg — ^ H depending on jet multiplicity [67, 68]. An uncertainty 
of 4-5% is assumed for the inclusive cross section of the single vector boson and di-boson pro- 
duction mechanisms and a relative uncertainty of 24% is added in quadrature per additional 
jet. For both tt production and single top-quark production, the QCD scale uncertainties 
are in the range of 3-6% [69-71]. The uncertainties related to the PDF amount to 8% for 
the predominantly gluon-initiated processes, gg — > H and tt , and 4% for the predominantly 
quark-initiated processes, VBF, VH, single vector boson and di-boson production [72-75]. 
The systematic uncertainty arising from the choice of different sets of PDF is included. In 
addition to the theoretical errors considered in Ref. [66], other effects are taken into account. 
Uncertainties related to hadronisation effects are estimated by replacing PYTHIA with HER- 
WIG. Effects due to initial and final state radiation are assessed with PYTHIA samples where 
the gluon emission is changed according to Ref. [76] . The effect of a different choice of parton 
shower and underlying event parametrisation yields a total uncertainty of about 10% on the 
acceptance of the Higgs boson produced via the VBF mechanism in the H+2jet VBF channel. 

Detector-related uncertainties: The uncertainty on the integrated luminosity is con- 
sidered as fully correlated across all analysis categories and amounts to 3.9% [77, 78]. The 
effect of pileup on the signal and background expectations is modelled in the Monte Carlo 
simulations and the corresponding uncertainty is taken into account. 

Appropriate scale factors for the trigger efficiencies of electron, muon and hadronic r 
triggers are obtained from data and applied to the Monte Carlo simulations. The associated 
systematic uncertainties are typically 1-2%. Differences between data and Monte Carlo sim- 
ulations in the reconstruction and identification efficiencies of r leptons, electrons and muons 
are taken into account, as well as the differences in the momentum scales and resolutions. 

The systematic uncertainties on the hadronic r decay identification efficiency are esti- 
mated from data samples enriched in W —> tu and Z/^f* — > tt events and they are less than 
4%. The energy scale uncertainties on the hadronic r and jets are evaluated based on the 
single hadron response in the calorimeters [79]. In addition, the Th a d energy scale is validated 
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in data samples enriched in Z/'y* — > tt events. The systematic uncertainties related to the 
r and jet energy scale, resolution and 6-veto are modelled as functions of i] and pi . The jet 
and r energy scale and resolution uncertainties are treated as correlated and propagated to 
the E™ ms calculation. Uncertainties associated with the remaining pileup noise and cluster 
activity in the calorimeters are also considered as independent E™ ms uncertainties. 

The detector-related uncertainties depend on the event topology and are typically small 
compared to the theoretical uncertainties. The main exceptions are the jet energy scale 
uncertainty, which reaches up to 12%, and the r energy scale uncertainty, which is in the 
range 2-5%. 

Background modelling uncertainties: The modelling of the Z/'y* — > tt background is 
performed with the data, as described in Section 2. Corresponding uncertainties are obtained 
by propagating variations of the Z/'y* — > jLi/i event selection and the muon energy subtraction 
procedure through the r-embedding procedure. Backgrounds with misreconstructed leptons 
and T na d candidates are estimated with data and the uncertainty in the estimation lies in the 
range of 6-40%. The uncertainty takes into account the dependence on the number of jets. 
The treatment of the other background processes varies across channels and the uncertainties 
related to the modelling are taken into account as described in Section 6. 

8 Statistical analysis 

The statistical analysis of the data employs a binned likelihood function constructed as the 
product of the likelihood terms for each category. A total of twelve categories are considered 
from the H — > Ti cp Ti cp , H — > Ti ep Thad and H — > T na d'Tiad channels. The likelihood in each 
category is a product over bins in the distributions of the MMC mass, collinear mass or 
effective mass shown in Figs. 3, 5 and 7. 

The expected signal and background, as well as the observed number of events, in each bin 
of the mass distributions enter in the definition of the likelihood function 6). A "signal 
strength" parameter (//) multiplies the expected signal in each bin. The signal strength 
is a free parameter in the fit procedure. The value fx = (fM = 1) corresponds to the 
absence (presence) of a Higgs boson signal with the SM production cross-section. Signal and 
background predictions (s and b) depend on systematic uncertainties that are parametrised 
by nuisance parameters 9, which in turn are constrained using Gaussian functions. The 
correlation of the systematic uncertainties across categories are taken into account: 



The expected signal and background event counts in each bin are functions of 0. The 
parametrisation is chosen such that the rates in each channel are log-normally distributed 
for a normally distributed 0. 

The test statistic is defined as: 





binj 



e 
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where fi and refer to the global maximum of the likelihood (with the constraint < £i < fj,) 
and 0^ corresponds to the conditional maximum likelihood of 6 for a given [i. This test 
statistic is used to compute exclusion limits following the modified frequentist method known 
as CL S [80]. The asymptotic approximation [81] is used to evaluate the Gaussian probability 
density functions rather than performing pseudo-experiments and the procedure has been 
validated using ensemble tests. 

The profile likelihood formalism used in this statistical analysis incorporates the informa- 
tion on the observed and expected number of events, nuisance parameters, probability density 
functions and parameters of interest. The statistical significance of an excess is evaluated in 
terms of the same profile likelihood test statistic. The expected sensitivity and the ±1,2<t 
bands are obtained for the background expectation in the absence of a Standard Model Higgs 
boson signal. The consistency with the background-only hypothesis is quantified using the 
p-value, the probability that the test statistic of a background-only experiment fluctuates to 
at least the observed one. 

9 Results 

No significant excess is observed in the data compared to the SM expectation in any of 
the channels studied here. Exclusion limits at the 95% confidence level, normalised to the 
Standard Model cross section times the branching ratio of H — > t + t~ (csm) 5 are set as 
a function of the Higgs boson mass. Figure 8 shows expected and observed limits for the 
individual channels and for the combined result. The combined expected limits vary between 
3.4 and 8.2 times the predicted Standard Model cross section times branching ratio for the 
mass range 100-150 GeV. The most sensitive categories are the H + 1-jet category in the 
Thad r had channel, the H +2-jet VBF category in the Ti ep Th a d channel and the H +2-jet VBF 
category in the Ti ep T\ ep channel. The observed limits are in the range between 2.9 and 11.7 
times the predicted Standard Model cross section times branching ratio for the same mass 
range. The most significant deviation from the background-only hypothesis is observed in the 
combined limit for m# = 150 GeV with a local p- value of 10%, corresponding to a significance 
of 1.3 a. 

10 Conclusions 

A search for a Higgs boson decaying in the H — > tt channel has been performed with 
the ATLAS detector at the Large Hadron Collider. It uses the full 2011 data sample of 
4.7 fb -1 collected at a centre-of-mass energy of 7 TeV. The H — > T\ cp T\ cp , H — > Ti ep Thad and 
H — > Thadrhad decays are considered in this search. No significant excess is observed in the 
mass range of 100-150 GeV. The observed (expected) upper limits on the cross section times 
the branching ratio of H — >■ tt are between 2.9 (3.4) and 11.7 (8.2) times the SM prediction. 
These limits are similar to results recently reported by the CMS experiment [82]. 
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Figure 8. Observed (solid) and expected (dashed) 95% confidence level upper limits on the Higgs 
boson cross section times branching ratio, normalised to the Standard Model expectation, as a function 
of the Higgs boson mass. Expected limits arc given for the scenario with no signal. The bands around 
the dashed line indicate the expected statistical fluctuations of the limit. Results are given for the 
H — > TiopTiop , H — > ricpThad j and H — > ThadThad channels independently and for all channels combined. 
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